International  Journal  of  Trend  in  Scientific 
Research  and  Development  (IJTSRD) 
International  Open  Access  Journal 


♦ . 

ISSN  No:  2456  -  6470  |  www.ijtsrd.com  |  Volume  -  2  |  Issue  -  4 


♦ 

♦ 


NOSQL:  A  Very  Dynamic  Approach  for  Managing  Big  Data 


Assira  Amin 

M.Tech  Scholar,  Department  of 
CSE’  Sat  Kabir  Institute  of 
Technology  &  Management, 
Haryana,  India 


Mudasir  Ahmed  Muttoo 

Assistant  Professor,  Department  of 
CSE’  SSM  College  of  Engineering 
&  Technology, 

Kashmir,  India 


Kirti  Bhatia 

Assistant  Professor,  Department  of 
CSE’  Sat  Kabir  Institute  of 
Technology  &  Management, 
Haryana,  India 


ABSTRACT 

Database  Management  System  is  very  dynamic  and 
popular  subject  in  Computer  Science  and  Information 
Technology.  Modem  society  is  based  on  ‘e’  concept 
means  every  task  is  achieved  by  interactive 
applications  like  e-commerce  applications  and  the 
production  of  data  is  more  and  more.  The  present 
digital  world  becomes  more  complex  with  Big  Data 
means  the  volume,  velocity  and  variety  of  data.  The 
Big  Data  is  a  global  system  and  is  considered  a  data 
collection  that  increases  so  large  and  can’t  be 
managed  efficiently  by  traditional  databases 
management  systems.  The  aim  of  this  paper  is  to 
identify  the  new  data  management  strategies  and  how 
to  tackle  Big  Data  through  a  new  breed  of  databases 
commonly  known  as  NOSQL  databases. 
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INTRODUCTION 

Database  management  system  is  a  methodology  for 
storing  and  accessing  data  efficiently.  Various  models 
have  been  proposed  by  experts  time  by  time  for  data 
management  and  the  relational  model  is  the  most 
popular  and  adoptable  by  professionals.  Modem 
applications  based  on  social  networking  and  business 
intelligence  changes  the  data  store  strategy  to  a  great 
extent.  The  traditional  databases  are  not  efficient  for 
handling  the  Big  Data  that  is  the  management  of  data 
in  terms  of  volume,  variety  and  velocity  of  data. 
NOSQL  databases  are  very  good  alternative  for 


managing  the  Big  Data.  NOSQL  is  a  group  of  non¬ 
relational  database  management  systems,  not  mostly 
based  on  tables  and  do  not  use  SQL  for  data 
manipulation.  NOSQL  databases  are  very  efficient 
specially  handling  huge  volumes  of  data.  NOSQL  is 
based  on  distributed  concept  for  huge  data  storage  and 
supports  parallel  processing  of  data  across  a  large 
number  of  commodity  servers. 

Big  Data  characteristics 

Big  Data  is  not  only  size  but  also  includes  the  variety 
and  velocity  of  data. 


Velocity 


The  volume  of  data  in  an  organization  grow 
regardless  of  the  size  because  there  is  a  need  to  store 
data  of  all  varieties  like  medical  data,  financial  data 
etc.  Today  data  of  organisations  are  in  the  terabyte 
range  but  soon  can  move  to  petabytes  or  exabytes. 
Data  in  an  enterprise  also  travel  from  various  sources 
in  different  ways  like  use  of  sensors,  deployment  of 
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smart  devices  and  the  most  important  social 
networking.  The  enterprise  data  not  only  contain 
structured  data  but  also  un-structured  and  semi- 
structured  data  thus  making  enterprise  data  more  and 
more  complex. 

Structured  Data 

Structured  one  represents  data  that  is  grouped  into  a 
relational  format  means  in  terms  of  rows  and  columns 
within  a  database  and  respond  to  the  simple  queries 
based  on  parameters  and  operational  requirements. 

Semi-structured  Data 

Semi-structured  data  is  form  of  structured  one  and  not 
validate  against  a  fixed  schema.  The  data  is  self  - 
describing  and  contains  tags  to  enforce  hierarchies  of 
fields  and  records. 

Un-Structured  Data 

This  type  includes  data  formats  which  are  not  indexed 
into  relational  tables  for  querying  like  audio,  video 
and  image  files. 

The  velocity  is  also  one  of  the  important  characteristic 
of  Big  Data.  Velocity  simple  means  how  fast  the  data 
appear,  stored  and  retrieved.  The  value  of  Big  Data  is 
extracted  by  enterprises  only  after  tackling  the  3 Vs 
means  volume,  variety  and  velocity  of  data. 

NOSQL  for  Big  Data 

DataStax  is  the  leading  provider  of  efficient  database 
technologies  and  services  and  is  very  famous  for 
providing  best  options  for  handling  the  Big  Data.  The 
foundation  for  DataStax  for  handling  Big  Data  is  the 
power  of  Apache  Cassandra  and  is  the  only  NOSQL 
database  technology  that  truly  handles  the  modem 
trend  of  data  management  commonly  known  as  Big 
Data.  The  key  technical  aspects  of  Cassandra  NOSQL 
database  technology  are  described  as  under: 

■  Cassandra  can  store  data  even  in  petabytes  and 
support  thousands  of  concurrent  users  per  second 
with  easiness. 

■  Supports  peer-to-peer  design  that  results  in  no 
single  point  of  failure  for  any  process  of  database. 

■  Cassandra  is  based  on  location  independent 
concept  means  data  can  be  written  and  read  from 
anywhere. 


■  RDBMS  like  data  durability  and  protection 
because  support  tunable  data  consistency. 

■  Cassandra  supports  flexible  design  means  support 
all  Big  Data  all  formats  including  structured, 
semi-structured  and  un-structured. 

■  The  replication  is  very  simple  and  supports  multi 
data  center  and  is  very  efficient  for  cloud 
environment. 

■  Very  comfortable  database  for  leading 
programming  languages. 

■  Cassandra  runs  efficiently  on  commodity 
hardware  means  no  need  of  additional  hardware. 

Theses  aspects  make  Cassandra  the  best  choice  for 
enterprises  specially  for  handling  Big  Data. 

CONCLUSION 

The  database  is  a  very  important  part  for  any 
application.  The  modem  applications  based  on  social 
networking  and  business  intelligence  change  the  data 
storage  concept  to  a  great  extent.  The  volume  of  data 
in  organizations  grows  rapidly  up  to  petabytes  or  even 
exabytes  and  is  very  difficult  for  traditional  database 
systems  to  handle.  The  NOSQL  is  a  new  open  source, 
distributed  data  storage  concept  that  is  very  efficient 
in  terms  of  handling  the  huge  volume  of  data.  The  Big 
Data  refers  not  only  the  volume  of  data  but  also  the 
variety  and  the  velocity  of  data. 
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